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In parentheses, the number of long-range constraints, SI, S2, . . . S5, various sets of constraints for 
18-56 residue 6pti fragment. 
Coordinate root mean square deviation between crystallographic coordinates of Ca's and the 

approximate positions of the model Ca's calculated as R^j = (4r, + r;.] + r^yS. 

MI, misfolded structure that has satisfied the long-range constraints, generally the topological 
mirror image fold. 



In the above cases, based on the conformational energy of just one (the last 
in a trajectory) snapshot, it was possible in all cases to identify the proper fold. 
However, it should be also noted that this very simple criterion may not always 
work. Indeed, in the case of the fourth set (S4) of long-range constraints for 6pti, the 
difference between the energy of a misfolded state and the lowest energy of properly 
folded states (simulation #3) was marginal. Moreover, the three remaining properly 
folded conformations have a higher energy than the misfolded one does. 
Fortunately, for bigger proteins, the situation is different. The energy gap between 
the proper fold and misfolded states is usually quite large, except for the cases where 
substantial all of the protein (or particular protein domain) has a P-conformation, 
and thus has a smallest number of long-range constraints. 

The reasons for the apparently lower reliability of the (3-protein prediction 
are complex, and several long-range constraints are required before complex P-type 
natural proteins can be folded. These proteins have a larger number of building 
blocks (compare the number of P-strands in an all-p protein with the number of 
helices in a hehcal structure of similar size) and, consequently, more complex folds. 
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